Data tagging

ABSTRACT

A method for characterizing data elements in an enterprise including ascertaining at least one of an access metric and a data identifier for each of a plurality of data elements and employing the at least one of an access metric and a data identifier to automatically apply a metatag to ones of the plurality of data elements.

REFERENCE TO RELATED APPLICATIONS

Reference is made to U.S. Provisional Patent Application Ser. No.61/348,829, filed May 27, 2010 and entitled “DATA MANAGEMENT USING DATATAGGING”, the disclosure of which is hereby incorporated by referenceand priority of which is hereby claimed pursuant to 37 CFR 1.78(a) (4)and (5)(i).

Reference is also made to U.S. patent application Ser. No. 13/014,762,filed Jan. 27, 2011, and entitled “AUTOMATIC RESOURCE OWNERSHIPASSIGNMENT SYSTEMS AND METHODS”, the disclosure of which is herebyincorporated by reference and priority of which is hereby claimedpursuant to 37 CFR 1.78(a) (1) and (2)(i).

Reference is also made to U.S. patent application Ser. No. 13/106,023,filed May 12, 2011, and entitled “AUTOMATIC RESOURCE OWNERSHIPASSIGNMENT SYSTEM AND METHOD”, the disclosure of which is herebyincorporated by reference and priority of which is hereby claimedpursuant to 37 CFR 1.78(a) (1) and (2)(i).

Reference is also made to the following patents and patent applications,owned by assignee, the disclosures of which are hereby incorporated byreference:

U.S. Pat. Nos. 7,555,482 and 7,606,801;

U.S. Published Patent Application Nos. 2007/0244899, 2008/0271157,2009/0100058, 2009/0119298; 2009/0265780; 2011/0060916 and 2011/0061111;and

U.S. patent application Ser. No. 12/673,691.

FIELD OF THE INVENTION

The present invention relates to improved systems and methodologies fordata tagging.

BACKGROUND OF THE INVENTION

The following patent publications are believed to represent the currentstate of the art:

U.S. Pat. Nos. 5,465,387; 5,899,991; 6,338,082; 6,393,468; 6,928,439;7,031,984; 7,068,592; 7,403,925; 7,421,740; 7,555,482, 7,606,801 and7,743,420; and

U.S. Published Patent Application Nos.: 2003/0051026; 2004/0249847;2005/0108206; 2005/0203881; 2005/0086529; 2006/0064313; 2006/0184530;2006/0184459; 2007/0203872; 2007/0244899; 2008/0271157; 2009/0100058;2009/0119298 and 2009/0265780.

SUMMARY OF THE INVENTION

The present invention provides improved systems and methodologies fordata tagging.

There is thus provided in accordance with a preferred embodiment of thepresent invention a method for characterizing data elements in anenterprise including ascertaining at least one of an access metric and adata identifier for each of a plurality of data elements and employingthe at least one of an access metric and a data identifier toautomatically apply a metatag to ones of the plurality of data elements.

Preferably, the method for characterizing data elements in an enterprisealso includes ascertaining an owner for each of the plurality of dataelements and requiring the owner to review and validate the metatagsautomatically applied to ones of the plurality of data elements of whichhe is the owner.

In accordance with a preferred embodiment of the present invention theemploying includes automatically applying specific ones of a pluralityof different metatags to specific ones of the plurality of dataelements. Additionally or alternatively, the employing includesautomatically applying to each one of the plurality of data elements ametatag previously applied to a parent folder thereof.

Preferably, the data identifier is one of file type, author, categoryand language.

In accordance with a preferred embodiment of the present invention themethod for characterizing data elements in an enterprise also includesmaintaining a database of access metrics for the each of a plurality ofdata elements. Additionally or alternatively, the method forcharacterizing data elements in an enterprise also includes maintaininga database of data identifiers for the each of a plurality of dataelements.

Preferably, the employing includes employing an access metric and a dataidentifier to automatically apply a metatag to ones of the plurality ofdata elements. Alternatively, the employing includes employing an accessmetric to automatically apply a metatag to ones of the plurality of dataelements. In another alternative embodiment the employing includesemploying a data identifier to automatically apply a metatag to ones ofthe plurality of data elements.

There is also provided in accordance with another preferred embodimentof the present invention a method for characterizing data elements in anenterprise including ascertaining at least one of an access metric and adata identifier for each of a plurality of data elements and employingthe at least one of an access metric and a data identifier toautomatically recommend application of metatags to the plurality of dataelements.

Preferably, the employing includes automatically recommendingapplication of specific ones of a plurality of different metatags tospecific ones of the plurality of data elements. Additionally oralternatively, the employing includes automatically recommendingapplication to each of the plurality of data elements a metatagpreviously applied to a parent folder thereof.

In accordance with a preferred embodiment of the present invention themethod for characterizing data elements in an enterprise also includesascertaining an owner for each of the plurality of data elements andrequiring the owner to review and validate application of therecommended metatags to ones of the plurality of data elements of whichhe is the owner.

Preferably, the data identifier is one of file type, author, categoryand language.

In accordance with a preferred embodiment of the present invention themethod for characterizing data elements in an enterprise also includesmaintaining a database of access metrics for the each of a plurality ofdata elements. Additionally or alternatively, the method forcharacterizing data elements in an enterprise also includes maintaininga database of data identifiers for the each of a plurality of dataelements.

Preferably, the employing includes employing an access metric and a dataidentifier to automatically recommend application of a metatag to onesof the plurality of data elements. Alternatively, the employing includesemploying an access metric to automatically recommend application ofmetatags to the plurality of data elements. In another alternativeembodiment, the employing includes employing a data identifier toautomatically recommend application of metatags to the plurality of dataelements.

There is yet further provided in accordance with still another preferredembodiment of the present invention a method for characterizing dataelements in an enterprise including ascertaining an owner for each of aplurality of data elements and requiring the owner to apply at least onemetatag to ones of the plurality of data elements of which he is theowner.

In accordance with a preferred embodiment of the present invention themethod for characterizing data elements in an enterprise also includesmaintaining a database of access metrics for the each of a plurality ofdata elements. Additionally or alternatively, the method forcharacterizing data elements in an enterprise also includes maintaininga database of data identifiers for the each of a plurality of dataelements.

There is even further provided in accordance with a further preferredembodiment of the present invention a method for characterizing dataelements in an enterprise including ascertaining an owner for each of aplurality of data elements and requiring the owner to review andvalidate metatags applied to ones of the plurality of data elements ofwhich he is the owner.

In accordance with a preferred embodiment of the present invention themethod for characterizing data elements in an enterprise also includesmaintaining a database of access metrics for the each of a plurality ofdata elements. Additionally or alternatively, the method forcharacterizing data elements in an enterprise also includes maintaininga database of data identifiers for the each of a plurality of dataelements.

There is also provided in accordance with yet another preferredembodiment of the present invention a method for characterizing dataelements in an enterprise including ascertaining an owner for each of aplurality of data elements and automatically recommending application ofmetatags by the owner to the plurality of data elements of which he isthe owner.

In accordance with a preferred embodiment of the present invention themethod for characterizing data elements in an enterprise also includesmaintaining a database of access metrics for the each of a plurality ofdata elements. Additionally or alternatively, the method forcharacterizing data elements in an enterprise also includes maintaininga database of data identifiers for the each of a plurality of dataelements.

There is further provided in accordance with still another preferredembodiment of the present invention a method of operating a file systemincluding maintaining a data owner/administrator accessible database ofmetatags assigned by data owner/administers to a plurality of dataelements; applying the metatags to the plurality of data elements in astorage platform and automatically synchronizing the metatags applied tothe plurality of data elements and the database.

There is still further provided in accordance with another preferredembodiment of the present invention a system for characterizing dataelements in an enterprise including access metrics collectionfunctionality operative to collect access metrics associated with aplurality of data elements, metadata collection functionality operativeto collect metadata associated with the plurality of data elements andmetatag application functionality operative to utilize the accessmetrics collection functionality and the metadata collectionfunctionality to automatically employ at least one of an access metricand a data identifier to automatically apply a metatag to ones of theplurality of data elements.

Preferably, the system for characterizing data elements in an enterprisealso includes metatag owner validation functionality operative toascertain owners of each of the plurality of data elements and torequire each of the owners to review and validate the metatagsautomatically applied to ones of the plurality of data elements of whichhe is the owner.

In accordance with a preferred embodiment of the present invention themetatag application functionality is also operative to automaticallyapply specific ones of a plurality of different metatags to specificones of the plurality of data elements. Additionally or alternatively,the metatag application functionality is also operative to automaticallyapply to each one of the plurality of data elements a metatag previouslyapplied to a parent folder thereof.

Preferably, the data identifier is one of file type, author, categoryand language.

In accordance with a preferred embodiment of the present invention thesystem for characterizing data elements in an enterprise also includesan access metrics database which stores the access metrics collected bythe access metrics collection functionality. Additionally oralternatively, the system for characterizing data elements in anenterprise and also includes a metadata database which stores themetadata collected by the metadata collection functionality.

Preferably, the metatag application functionality is also operative toutilize the access metrics collection functionality and the metadatacollection functionality to automatically employ an access metric and adata identifier to automatically apply a metatag to ones of theplurality of data elements. Alternatively, the metatag applicationfunctionality is also operative to utilize the access metrics collectionfunctionality to automatically employ an access metric to automaticallyapply a metatag to ones of the plurality of data elements. In anotheralternative embodiment, the metatag application functionality is alsooperative to utilize the metadata collection functionality toautomatically employ a data identifier to automatically apply a metatagto ones of the plurality of data elements.

There is yet further provided in accordance with still another preferredembodiment of the present invention a system for characterizing dataelements in an enterprise including access metrics collectionfunctionality operative to collect access metrics associated with aplurality of data elements, metadata collection functionality operativeto collect metadata associated with the plurality of data elements andmetatag recommendation functionality operative to utilize the accessmetrics collection functionality and the metadata collectionfunctionality to automatically employ at least one of an access metricand a data identifier to automatically recommend application of ametatag to ones of the plurality of data elements.

Preferably, the metatag recommendation functionality is also operativeto automatically recommend application of specific ones of a pluralityof different metatags to specific ones of the plurality of dataelements. Additionally or alternatively, the metatag recommendationfunctionality is also operative to automatically recommend applying toeach one of the plurality of data elements a metatag previously appliedto a parent folder thereof.

In accordance with a preferred embodiment of the present invention thesystem for characterizing data elements in an enterprise also includesmetatag owner validation functionality operative to ascertain owners ofeach of the plurality of data elements and to require each of the ownersto review and validate application of the recommended metatags to onesof the plurality of data elements of which he is the owner.

Preferably, the data identifier is one of file type, author, categoryand language.

In accordance with a preferred embodiment of the present invention thesystem for characterizing data elements in an enterprise also includesan access metrics database which stores the access metrics collected bythe access metrics collection functionality. Additionally oralternatively, the system for characterizing data elements in anenterprise also includes a metadata database which stores the metadatacollected by the metadata collection functionality.

Preferably, the metatag recommendation functionality is also operativeto utilize the access metrics collection functionality and the metadatacollection functionality to automatically employ an access metric and adata identifier to automatically recommend application of a metatag toones of the plurality of data elements. Alternatively, the metatagrecommendation functionality is also operative to utilize the accessmetrics collection functionality to automatically employ an accessmetric to automatically recommend application of a metatag to ones ofthe plurality of data elements. In another alternative embodiment themetatag recommendation functionality is also operative to utilize themetadata collection functionality to automatically employ a dataidentifier to automatically recommend application of a metatag to onesof the plurality of data elements.

There is even further provided in accordance with yet another preferredembodiment of the present invention a system for characterizing dataelements in an enterprise including metatag owner validationfunctionality operative to ascertain owners of each of the plurality ofdata elements and to require each of the owners apply at least onemetatag to ones of the plurality of data elements of which he is theowner.

Preferably, the system for characterizing data elements in an enterprisealso includes an access metrics database which stores access metricsassociated with the plurality of data elements. Additionally oralternatively, the system for characterizing data elements in anenterprise also includes a metadata database which stores metadataassociated with the plurality of data elements.

There is also provided in accordance with still another preferredembodiment of the present invention a system for characterizing dataelements in an enterprise including metatag owner validationfunctionality operative to ascertain owners of each of a plurality ofdata elements and to require each of the owners to review and validateapplication of metatags to ones of the plurality of data elements ofwhich he is the owner.

In accordance with a preferred embodiment of the present invention thesystem for characterizing data elements in an enterprise also includesan access metrics database which stores access metrics associated withthe plurality of data elements. Additionally or alternatively, thesystem for characterizing data elements in an enterprise also includes ametadata database which stores metadata associated with the plurality ofdata elements.

There is yet further provided in accordance with yet another preferredembodiment of the present invention a system for characterizing dataelements in an enterprise including metatag owner validationfunctionality operative to ascertain owners of each of a plurality ofdata elements and to recommend application of metatags by each of theowners to ones of the plurality of data elements of which he is theowner.

Preferably, the system for characterizing data elements in an enterprisealso includes an access metrics database which stores access metricsassociated with the plurality of data elements. Additionally oralternatively, the system for characterizing data elements in anenterprise also includes a metadata database which stores metadataassociated with the plurality of data elements.

There is still further provided in accordance with another preferredembodiment of the present invention a system of operating a file systemincluding a data owner/administrator accessible database of metatagsassigned by data owner/administers to a plurality of data elements,metatag application functionality operative to apply the metatags to theplurality of data elements in a storage platform and synchronizingfunctionality operative to automatically synchronize the metatagsapplied to the plurality of data elements and the database.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will be understood and appreciated more fully fromthe following detailed description, taken in conjunction with thedrawings in which:

FIG. 1 is a simplified pictorial illustration of an example of theoperation of an automatic data tagging system constructed and operativein accordance with a preferred embodiment of the present invention;

FIG. 2 is a simplified pictorial illustration of an example of theoperation of an automatic data tagging system constructed and operativein accordance with another preferred embodiment of the presentinvention;

FIG. 3 is a simplified pictorial illustration of an example of theoperation of an automatic data tagging system constructed and operativein accordance with yet another preferred embodiment of the presentinvention;

FIG. 4 is a simplified flowchart indicating steps in the operation of anautomatic data tagging system constructed and operative in accordancewith a preferred embodiment of the present invention;

FIG. 5 is a simplified flowchart indicating steps in the operation of anautomatic data tagging system constructed and operative in accordancewith another preferred embodiment of the present invention;

FIG. 6 is a simplified flowchart indicating steps in the operation of anautomatic data tagging system constructed and operative in accordancewith yet another preferred embodiment of the present invention; and

FIG. 7 is a simplified block diagram illustration of the automatic datatagging system whose functionality is illustrated in FIGS. 1-6.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

Reference is now made to FIG. 1, which is a simplified pictorialillustration of an example of the operation of an automatic data taggingsystem constructed and operative in accordance with a preferredembodiment of the present invention. The automatic data tagging systemof FIG. 1 is preferably suitable for operating in an enterprise computernetwork including multiple disparate clients, data elements, computerhardware resources and computer software resources.

The operation of the automatic data tagging system of FIG. 1 preferablyincludes characterizing data elements in an enterprise by ascertainingat least one of an access metric and a data identifier for each of aplurality of data elements, and employing the at least one of an accessmetric and a data identifier to automatically apply a metatag to ones ofthe plurality of data elements. The operation of the automatic datatagging system also preferably includes ascertaining an owner for eachof the plurality of data elements and requiring the owner to review andvalidate the metatags automatically applied to ones of the plurality ofdata elements of which he is the owner.

The term “data identifier” is used throughout to refer to metadataassociated with a data element. The data identifier may be acontent-based data identifier or a non content-based data identifier. Acontent-based data identifier associated with a data element preferablyincludes, for example, keywords or an abstract of the content of thedata element. A non content-based data identifier associated with a dataelement preferably includes characteristics associated with the dataelement such as, for example, file type, author, category and language.A non content-based-data identifier associated with a data element mayalso include one or more non content-based data identifiers associatedwith a parent folder of the data element. It is appreciated that themetadata may comprise predefined characteristics provided by the systemwhich hosts the data elements or user-defined characteristics.

The term “metatag” is used throughout to refer to a metadata tag whichis associated with a data element. Metatags are useful, for example, forautomating data management tasks and for identifying data elements whichmay be grouped or categorized together for purposes of automatic ormanual data management tasks.

The automatic data tagging system of FIG. 1 typically resides on an aserver 100 that is connected to an enterprise computer network 102 whichpreferably includes multiple disparate clients 104, servers 106 and datastorage resources 108. Typically, data elements, such as computer files,reside on servers 106 and on data storage resources 108 and areaccessible to users of the network 102 in accordance with accesspermissions defined by an owner of each data element or each dataelement folder. It is appreciated that the data elements may reside onany suitable data storage system or platform, such as a file system or adata collaboration system, which may reside on any suitable computeroperating system or infrastructure.

Preferably, the system continuously maintains a database of actualaccess and access permissions of every user to every data element in theenterprise. This functionality is described in U.S. Pat. No. 7,606,801,in U.S. Published Patent Application 2009/0265780 and in U.S. patentapplication Ser. No. 12/673,691 owned by assignee, the disclosures ofwhich are hereby incorporated by reference. Access permissions and/oractual access are together designated as access metrics and may be usedto designate subsets of all of the data elements in the enterprise.

Preferably, the system also continuously crawls over at least a subsetof all data elements in the enterprise and maintains a database ofmetadata associated with each of the subset of data elements.

As shown in FIG. 1, an IT Administrator of enterprise network 102decides to utilize the automatic data tagging system residing on server100 to automatically tag a subset of files which files have accesspermissions to the ‘Everyone’ group and contain the term ‘confidential’as being vulnerable files. The Administrator then decides to send a listof the vulnerable files to their respective owners for accesspermissions remediation. In the example of FIG. 1 access permissionsremediation may include, for example, modification of the accesspermissions of the file to include access permissions only to trustedindividuals who require ongoing access to the file.

Reference is now made to FIG. 2, which is a simplified pictorialillustration of an example of the operation of an automatic data taggingsystem constructed and operative in accordance with another preferredembodiment of the present invention. The automatic data tagging systemof FIG. 2 is preferably suitable for operating in an enterprise computernetwork including multiple disparate clients, data elements, computerhardware resources and computer software resources.

The operation of the automatic data tagging system of FIG. 2 preferablyincludes characterizing data elements in an enterprise by ascertainingat least one of an access metric and a data identifier for each of aplurality of data elements, and employing the at least one of an accessmetric and a data identifier to automatically apply a metatag to ones ofthe plurality of data elements. The operation of the automatic datatagging system also preferably includes ascertaining an owner for eachof the plurality of data elements, and requiring the owner to review andvalidate the metatags automatically applied to ones of the plurality ofdata elements of which he is the owner.

The automatic data tagging system of FIG. 2 typically resides on an aserver 200 that is connected to an enterprise computer network 202 whichpreferably includes multiple disparate clients 204, servers 206 and datastorage resources 208. Typically, data elements, such as computer files,reside on servers 206 and on data storage resources 208 and areaccessible to users of the network in accordance with access permissionsdefined by an owner of each data element or each data element folder. Itis appreciated that the data elements may reside on any suitable datastorage system or platform, such as a file system or a datacollaboration system, which may reside on any suitable computeroperating system or infrastructure.

Preferably, the system continuously maintains a database of actualaccess and access permissions of every user to every data element in theenterprise. This functionality is described in U.S. Pat. No. 7,606,801,in U.S. Published Patent Application 2009/0265780 and in U.S. patentapplication Ser. No. 12/673,691 owned by assignee, the disclosures ofwhich are hereby incorporated by reference. Access permissions and/oractual access are together designated as access metrics and may be usedto designate subsets of all of the data elements in the enterprise.

Preferably, the system also continuously crawls over at least a subsetof all data elements in the enterprise and maintains a database ofmetadata associated with each of the subset of data elements.

As shown in FIG. 2, an IT Administrator of enterprise network 202decides to utilize the automatic data tagging system residing on server200 to automatically tag a subset of files which files are owned byDave, the company attorney, as being ‘legal’ files. The Administratorthen decides to send a list of the legal files to Dave, requesting Daveto ascertain and confirm that the files tagged as ‘legal’ are actuallylegal-related files. As seen in FIG. 2, Dave ascertains and confirmsthat the file Contract1.doc and Agreement2.doc are actuallylegal-related files, while Resume5.doc is not legal-related, andtherefore should not be tagged as ‘legal’.

Reference is now made to FIG. 3, which is a simplified pictorialillustration of an example of the operation of an automatic data taggingsystem constructed and operative in accordance with yet anotherpreferred embodiment of the present invention. The automatic datatagging system of FIG. 3 is preferably suitable for operating in anenterprise computer network including multiple disparate clients, dataelements, computer hardware resources and computer software resources.

The operation of the automatic data tagging system of FIG. 3 preferablyincludes characterizing data elements in an enterprise by ascertainingan owner for each of a plurality of data elements, and requiring theowner to apply metatags to ones of the plurality of data elements ofwhich he is the owner.

The automatic data tagging system of FIG. 3 typically resides on an aserver 300 that is connected to an enterprise computer network 302 whichpreferably includes multiple disparate clients 304, servers 306 and datastorage resources 308. Typically, data elements, such as computer files,reside on servers 306 and on data storage resources and are accessibleto users of the network in accordance with access permissions defined byan owner of each data element or each data element folder. It isappreciated that the data elements may reside on any suitable datastorage system or platform, such as a file system or a datacollaboration system, which may reside on any suitable computeroperating system or infrastructure.

Preferably, the system continuously maintains a database of actualaccess and access permissions of every user to every data element in theenterprise. This functionality is described in U.S. Pat. No. 7,606,801,in U.S. Published Patent Application 2009/0265780 and in U.S. patentapplication Ser. No. 12/673,691 owned by assignee, the disclosures ofwhich are hereby incorporated by reference. Access permissions and/oractual access are together designated as access metrics and may be usedto designate subsets of all of the data elements in the enterprise.

Preferably, the system also continuously crawls over at least a subsetof all data elements in the enterprise and maintains a database ofmetadata associated with each of the subset of data elements.

As shown in FIG. 3, an IT Administrator of enterprise network 302decides to request from all owners of a subset of files to manually tagthe files. The Administrator utilizes the system residing on server 300to automatically ascertain the owners of the files and to send a requestto each owner to tag their respectively owned files. As seen in FIG. 3,upon receiving the request, each file owner tags their respectivelyowned files.

Reference is now made to FIG. 4, which is a simplified flowchartindicating steps in the operation of an automatic data tagging systemconstructed and operative in accordance with a preferred embodiment ofthe present invention. The automatic data tagging system of FIG. 4 ispreferably suitable for operating in an enterprise computer networkincluding multiple disparate clients, data elements, computer hardwareresources and computer software resources.

The operation of the automatic data tagging system of FIG. 4 preferablyincludes characterizing data elements in an enterprise by ascertainingat least one of an access metric and a data identifier for each of aplurality of data elements, and employing the at least one of an accessmetric and a data identifier to automatically apply a metatag to ones ofthe plurality of data elements. The operation of the automatic datatagging system also preferably includes ascertaining an owner for eachof the plurality of data elements, and requiring the owner to review andvalidate the metatags automatically applied to ones of the plurality ofdata elements of which he is the owner.

The automatic data tagging system of FIG. 4 typically resides on aserver that is connected to an enterprise computer network whichpreferably includes multiple disparate clients, servers and data storageresources. Typically, data elements, such as computer files, reside onservers and on data storage resources and are accessible to users of thenetwork in accordance with access permissions defined by an owner ofeach data element or each data element folder. It is appreciated thatthe data elements may reside on any suitable data storage system orplatform, such as a file system or a data collaboration system, whichmay reside on any suitable computer operating system or infrastructure.

Preferably, the system continuously maintains a database of actualaccess and access permissions of every user to every data element in theenterprise. This functionality is described in U.S. Pat. No. 7,606,801,in U.S. Published Patent Application 2009/0265780 and in U.S. patentapplication Ser. No. 12/673,691 owned by assignee, the disclosures ofwhich are hereby incorporated by reference. Access permissions and/oractual access are together designated as access metrics and may be usedto designate subsets of all of the data elements in the enterprise.

Preferably, the system also continuously crawls over at least a subsetof all data elements in the enterprise and maintains a database ofmetadata associated with each of the subset of data elements.

As shown in FIG. 4, the system preferably continuously maintains adatabase of access metrics which include actual access and accesspermissions of every user to every data element in the enterprise (400).The system also preferably continuously crawls over at least a subset ofall data elements in the enterprise and maintains a database of metadataassociated with each of the subset of data elements (402). Preferably,the system utilizes the database of stored access metrics and thedatabase of metadata to automatically apply a metatag to each of thesubset of data elements (404). Alternatively, the system mayautomatically apply the metatag assigned to the parent folder of each ofthe subset of data elements to the data element.

Additionally, the system preferably ascertains an owner for each of thesubset of data elements (406), and requires the owner of each of thesubset of data elements to review and validate the metatag automaticallyapplied to the data element (408).

Reference is now made to FIG. 5, which is a simplified flowchartindicating steps in the operation of an automatic data tagging systemconstructed and operative in accordance with another preferredembodiment of the present invention. The automatic data tagging systemof FIG. 5 is preferably suitable for operating in an enterprise computernetwork including multiple disparate clients, data elements, computerhardware resources and computer software resources.

The operation of the automatic data tagging system of FIG. 5 preferablyincludes characterizing data elements in an enterprise by, ascertainingat least one of an access metric and a data identifier for each of aplurality of data elements, and employing the at least one of an accessmetric and a data identifier to automatically recommend application ofmetatags to the plurality of data elements. Preferably, the recommendingof application of metatags to the plurality of data elements includesautomatically recommending application of specific ones of a pluralityof different metatags to specific ones of said plurality of dataelements.

The automatic data tagging system of FIG. 5 typically resides on aserver that is connected to an enterprise computer network whichpreferably includes multiple disparate clients, servers and data storageresources. Typically, data elements, such as computer files, reside onservers and on data storage resources and are accessible to users of thenetwork in accordance with access permissions defined by an owner ofeach data element or each data element folder. It is appreciated thatthe data elements may reside on any suitable data storage system orplatform, such as a file system or a data collaboration system, whichmay reside on any suitable computer operating system or infrastructure.

Preferably, the system continuously maintains a database of actualaccess and access permissions of every user to every data element in theenterprise. This functionality is described in U.S. Pat. No. 7,606,801,in U.S. Published Patent Application 2009/0265780 and in U.S. patentapplication Ser. No. 12/673,691 owned by assignee, the disclosures ofwhich are hereby incorporated by reference. Access permissions and/oractual access are together designated as access metrics and may be usedto designate subsets of all of the data elements in the enterprise.

Preferably, the system also continuously crawls over at least a subsetof all data elements in the enterprise and maintains a database ofmetadata associated with each of the subset of data elements.

As shown in FIG. 5, the system preferably continuously maintains adatabase of access metrics which include actual access and accesspermissions of every user to every data element in the enterprise (500).The system also preferably continuously crawls over at least a subset ofall data elements in the enterprise and maintains a database of metadataassociated with each of the subset of data elements (502). Preferably,the system utilizes the database of stored access metrics and thedatabase of metadata to recommend applying at least one metatag to eachof the subset of data elements (504).

Additionally, the system preferably ascertains an owner for each of thesubset of data elements (506), and requires the owner of each of thesubset of data elements to review and approve the recommendation toapply the at least one metatag to the data element (508).

Reference is now made to FIG. 6, which is a simplified flowchartindicating steps in the operation of an automatic data tagging systemconstructed and operative in accordance with yet another preferredembodiment of the present invention. The automatic data tagging systemof FIG. 6 is preferably suitable for operating in an enterprise computernetwork including multiple disparate clients, data elements, computerhardware resources and computer software resources.

The operation of the automatic data tagging system of FIG. 6 preferablyincludes characterizing data elements in an enterprise by ascertainingan owner for each of a plurality of data elements, and requiring theowner to apply metatags to ones of the plurality of data elements ofwhich he is the owner.

The automatic data tagging system of FIG. 6 typically resides on aserver that is connected to an enterprise computer network whichpreferably includes multiple disparate clients, servers and data storageresources. Typically, data elements, such as computer files, reside onservers and on data storage resources and are accessible to users of thenetwork in accordance with access permissions defined by an owner ofeach data element or each data element folder. It is appreciated thatthe data elements may reside on any suitable data storage system orplatform, such as a file system or a data collaboration system, whichmay reside on any suitable computer operating system or infrastructure.

Preferably, the system continuously maintains a database of actualaccess and access permissions of every user to every data element in theenterprise. This functionality is described in U.S. Pat. No. 7,606,801,in U.S. Published Patent Application 2009/0265780 and in U.S. patentapplication Ser. No. 12/673,691 owned by assignee, the disclosures ofwhich are hereby incorporated by reference. Access permissions and/oractual access are together designated as access metrics and may be usedto designate subsets of all of the data elements in the enterprise.

Preferably, the system also continuously crawls over at least a subsetof all data elements in the enterprise and maintains a database ofmetadata associated with each of the subset of data elements.

As shown in FIG. 6, the system preferably continuously maintains adatabase of access metrics which include actual access and accesspermissions of every user to every data element in the enterprise (600).Preferably, the system utilizes the database of access metrics toascertain an owner for each of the data elements (602), and requires theowner of each of the data elements to apply at least one metatag to eachof the data elements of which he is the owner (604).

Reference is now made to FIG. 7, which is a simplified block diagramillustration of the automatic data tagging system whose functionality isillustrated in FIGS. 1-6. The automatic data tagging system 700typically resides on an a server 702 that is connected to an enterprisecomputer network 704 which preferably includes multiple disparateclients 706, servers 708 and data storage resources 710. Typically, dataelements, such as computer files, reside on servers 708 and on datastorage resources 710 and are accessible to users of the network inaccordance with access permissions defined by an owner of each dataelement or each data element folder. It is appreciated that the dataelements may reside on any suitable data storage system or platform,such as a file system or a data collaboration system, which may resideon any suitable computer operating system or infrastructure.

As shown in FIG. 7, the automatic data tagging system 700 comprisesaccess metrics collection functionality 720 and metadata collectionfunctionality 722. As described hereinabove regarding FIGS. 1-6, accessmetrics collection functionality 720 preferably stores access metrics inan access metrics database 724 and metadata collection functionality 722preferably stores data element metadata in metadata database 726.

Metatag functionality 730 is preferably provided to utilize databases724 and 726 to automatically apply metatags to data elements residinganywhere on network 704, as described hereinabove with regard to FIGS.1-6. Metatag functionality 730 preferably includes metatag applicationfunctionality 732, which is operative to apply metatags to dataelements, and metatag recommendation functionality 734, which isoperative to recommend application of metatags to data elements. Metatagapplication functionality 730 also preferably includes metatag ownervalidation functionality 736, which is operative to ascertain owners ofdata elements and to require the owners of the data elements to assignmetatags to data elements or to validate recommendations of metatagassignment recommended by metatag recommendation functionality 734.

It will be appreciated by persons skilled in the art that the presentinvention is not limited by what has been particularly shown anddescribed hereinabove. Rather, the invention also includes variouscombinations and subcombinations of the features described hereinaboveas well as modifications and variations thereof, which would occur topersons skilled in the art upon reading the foregoing and which are notin the prior art.

The invention claimed is:
 1. A method for characterizing data elements,each of said data elements being accessible to users of an enterprisecomputer network in accordance with access permissions explicitlyassigned by an assigned owner thereof, said method comprising:continuously maintaining a database of said access permissionsexplicitly assigned by said assigned owner; continuously maintaining adatabase of data identifiers associated with said plurality of dataelements; specifying, by an administrator, at least one accesspermission explicitly assigned by said assigned owner and at least onedata identifier; ascertaining which of said plurality of said dataelements have both said at least one access permission explicitlyassigned by said assigned owner and said at least one data identifierassociated therewith; specifying, by an administrator, administratordefined metatags to be associated with each of said data elementsascertained to have both said at least one access permission explicitlyassigned by said assigned owner and said at least one data identifierassociated therewith; automatically applying a metatag from among saidadministrator defined metatags to ones of said plurality of dataelements ascertained to have said at least one access permissionexplicitly assigned by said assigned owner and said at least onespecific data identifier associated therewith; ascertaining an assignedowner for each one of said plurality of data elements ascertained tohave said at least one access permission explicitly assigned by saidassigned owner and said at least one data identifier associatedtherewith, said assigned owner having authority and accountability withrespect to said one data element within said enterprise; and requiringsaid assigned owner to review and validate said metatags automaticallyapplied to said ones of said plurality of data elements owned thereby.2. The method for characterizing data elements in an enterpriseaccording to claim 1 and wherein said automatically applying a metatagcomprises automatically applying specific ones of a plurality ofdifferent metatags to specific ones of said plurality of data elements.3. The method for characterizing data elements in an enterpriseaccording to claim 1 and wherein said automatically applying a metatagcomprises automatically applying to each one of said plurality of dataelements a metatag previously applied to a parent folder thereof.
 4. Themethod for characterizing data elements in an enterprise according toclaim 1 and wherein said data identifier is one of file type, author,category and language.
 5. The method for characterizing data elements inan enterprise according to claim 1 and wherein said automaticallyapplying a metatag comprises automatically applying a metatag to ones ofsaid plurality of data elements.
 6. A system having a computercomprising a processor, a memory and a non-transitory, tangiblecomputer-readable medium in which computer program instructions arestored, which instructions, when executed by said processor, cause thecomputer to characterize data elements, each of said data elements beingaccessible to users of an enterprise computer network in accordance withaccess permissions explicitly assigned by an assigned owner thereof,said system comprising: access metrics collection functionalityoperative to collect said access permissions explicitly assigned by saidassigned owner thereof; metadata collection functionality operative tocollect data identifiers associated with said plurality of dataelements; metatag specification functionality operative to facilitatespecifying, by an administrator, administrator defined metatags to beassociated with each of said data elements having both at least oneaccess metric permission explicitly assigned by said assigned owner andat least one data identifier associated therewith; metatag applicationfunctionality operative to utilize said access metrics collectionfunctionality and said metadata collection functionality to ascertainwhich of said plurality of said data elements have both said at leastone access permission explicitly assigned by said assigned owner andsaid at least one specific data identifier associated therewith and toautomatically apply a metatag from among said administrator definedmetatags to ones of said plurality of data elements ascertained to havesaid at least one access permission explicitly assigned by said assignedowner and said at least one data identifier associated therewith; andmetatag owner validation functionality operative to ascertain anassigned owner of each one of said plurality of data elementsascertained to have said at least one access permission explicitlyassigned by said assigned owner and said at least one data identifierassociated therewith, said assigned owner having authority andaccountability with respect to said one data element within saidenterprise, and to require each of said assigned owners to review andvalidate said metatags automatically applied to said ones of saidplurality of data elements owned thereby.
 7. The system according toclaim 6 and wherein said metatag application functionality is alsooperative to automatically apply specific ones of a plurality ofdifferent metatags to specific ones of said plurality of data elements.8. The system according to claim 6 and wherein said metatag applicationfunctionality is also operative to automatically apply to each one ofsaid plurality of data elements a metatag previously applied to a parentfolder thereof.
 9. The system according to claim 6 and wherein said dataidentifier is one of file type, author, category and language.
 10. Thesystem according to claim 6 and also comprising a metadata databasewhich stores said metadata collected by said metadata collectionfunctionality.
 11. The system according to claim 6 and wherein saidmetatag application functionality is also operative to utilize saidaccess metrics collection functionality and said metadata collectionfunctionality to automatically employ said access metric and said dataidentifier to automatically apply a metatag to ones of said plurality ofdata elements.
 12. The system according to claim 6 and wherein saidmetatag application functionality is also operative to utilize saidaccess metrics collection functionality to automatically employ saidaccess metric to automatically apply a metatag to ones of said pluralityof data elements.
 13. The system according to claim 6 and wherein saidmetatag application functionality is also operative to utilize saidmetadata collection functionality to automatically employ said dataidentifier to automatically apply a metatag to ones of said plurality ofdata elements.